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Abstract. Cells often have tens of thousands of receptors, even though only a few 
activated receptors can trigger full cellular responses. Reasons for the overabundance 
of receptors remain unclear. We suggest that, in certain conditions, the large number 
of receptors results in a competition among receptors to be the first to activate the cell. 
The competition decreases the variability of the time to cellular activation, and hence 
results in a more synchronous activation of cells. We argue that, in simple models, 
this variability reduction does not necessarily interfere with the receptor specificity to 
ligands achieved by the kinetic proofreading mechanism. Thus cells can be activated 
accurately in time and specifically to certain signals. We predict the minimum number 
of receptors needed to reduce the coefficient of variation for the time to activation 
following binding of a specific ligand. Further, we predict the maximum number of 
receptors so that the kinetic proofreading mechanism still can improve the specificity 
of the activation. These predictions fall in line with experimentally reported receptor 
numbers for multiple systems. 

PACS numbers: 87.16.Xa, 87.17.Aa, 87.15.R 
Keywords: first passage time, cooperativity, noise suppression, kinetic proofreading, 
specificity, spare receptors 

1. Introduction 

Cellular responses to changes in the surrounding world are mediated by protein signaling 
pathways. These pathways are typically activated by modification of cell surface 
receptors. For example, receptors on the surface of immune cells are a key component 
in detecting pathogens and activating cellular proliferation, degranulation, and other 
immune responses. The number of such receptors on the cell surface typically is 
lO"^ . . . 10^ P, El [3]|f] Surprisingly, sometimes as few as a hundred or even fewer bound 

I These and subsequent numbers have to be taken with a grain of salt since, first of all, they represent 
a compilation of different systems in different organisms and cell lines. Secondly, a variety of processes 
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receptors are needed to fully activate a cell. For example, T-cells can be activated by 
1. . . 10 receptors at the T-cell-B-cell synapse [lOl [11]. Rat basophilic leukemia cells 
require fewer than 5% active Fce-RI receptors to degranulate [I2]. This is not unique 
to immune receptors. For example, cellular response can be triggered by only a few 
active estrogen receptors [13] . This phenomenon of cells having an excessive number of 
receptors compared to what is needed for activation has been known for a while as a 
problem of "spare receptors" [HI [T3] 

Functional importance of this overabundance of receptors is not understood, and 
several explanations can be considered. One possibility is that the chemical information 
may not be distributed uniformly in space, requiring receptors at multiple locations on 
the cell surface. An example is the T-cell, which is commonly activated by antigen- 
presenting cells through a localized synapse [TU]. Alternatively, the number of ligands 
may be very low, and then the probability of receptor-ligand binding is increased by 
having many receptors even in the well-mixed chemical kinetics limit [lOj. Finally, large 
number of receptors facilitates clustering, promoting collective response, which may lead 
to signal amplification [TTl |T8] . 

Here we suggest that, in an alternative regime of a high ligand concentration, the 
large number of receptors may play an additional role. For example, for biological 
processes such as the immune response, synchronous and short cellular activation 
times would allow a concerted defense against an infection, and may be functionally 
advantageous. Similarly, concerted response is important in growth receptor signaling. 
We suggest that such improvement in the accuracy of the activation time may be 
achieved through a competition among many receptors. 

We envision a receptor that, when activated, produces an active messenger 
molecule. A few, and maybe just one, of such molecules are sufficient to start 
transcription or otherwise activate the whole cell. Each receptor takes an extended 
and variable amount of time to activate. Then the first few receptors out of many that 
have achieved full activation will activate the cell. The variability of the activation time 
of these fastest receptors can be smaller than that of a typical receptor. To build an 
intuition, think about the distribution of finish times for runners in a marathon. In 
this analogy, each cell is a marathon with many receptors representing runners. They 
all start at about the same time, launched into the activation race by an abundant 
ligand, and activated receptors are represented by runners who have finished the race. 
In large marathons with thousands of participants, the time it takes to reach the finish 
line for the first few runners is much smaller than the mean of the finishing times. More 
importantly, for an individual marathon, the variability in finish times among all the 

complicate the picture. For example, prior ligand exposure history may influence the number of 
receptors and of associated kinases, hence controlling the system's gain [H [5]. Further, receptors 
may form aggregates that activate collectively, so that the ability of a receptor to activate a cell is a 
function of its physical context [6l[71[8]. Finally, serial engagement allows multiple receptor (and cell) 
activations by the same ligand molecule [9] . Nonetheless, we believe these numbers to be correct within 
an order of magnitude, and hence illustrative. 
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runners can be fairly large, of order of several hours. And yet the distribution of times of 
the winners of many marathons has the variability of the order of minutes. In a similar 
manner, the receptors that are activated first in different cells might have activation 
times with small variance, leading to a more synchronous response. 

In this work, we approach the problem in the context of simplified receptor models. 
The receptor activation is first modeled as a linear chain of L states, such as ligand 
binding, conformational changes, dimerization, etc. Ligand presentation starts the 
progression along the chain, and the final state represents receptor activation. With 
this model, the probability distribution of time to activation of the first receptor out 
of receptors, with ^ 1, is given by a Weibull distribution, which has a small 
coefficient of variation. When m (1 ^ m ^ A^) receptors are needed to activate the 
cell, the coefficient of variation decreases even further. This "racing to activation" 
results in a short and less variable time to activation. 

Finally, we point out that, to activate specifically to a certain ligand, cells often 
use kinetic proofreading [191 EQl El]. We show that a large number of receptors 
does not necessarily interfere with the ability to proofread. It enables cells to be 
activated temporally precisely, but also specifically to certain ligands. Using biologically 
reasonable parameter values, we predict the number of receptors that allows for both of 
these properties. 



Multistep Activation 
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Figure 1. Schematic description of the model, (a) L irreversible steps with rates k 
are needed to activate a receptor, (b) Kinetic proofreading, denoted by the backwards 
transitions with the rate 7, allows the receptor preferentially by a correct ligand. 

2. Multistep Activation 

A simplified model of receptor activation consists of a linear chain of L irreversible 
events where the receptor undergoes biochemical transformations that ultimately lead 
to activation, see Fig. [T^. In this figure, the multistep activation process begins at the 
diamond (site i = 0), and at each site the system can transit one step towards activation 
with a forward rate k. Reaching the right-most site, i = L, will lead to the receptor 
activation irreversible on the time scales considered. In order to simplify the algebraic 
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expressions, we assume an equal forward rate for all sites. This simplification may 
reduce the generality of our results somewhat, but it does not change their dependence 
on L, provided none of the rates k are small enough to form a bottleneck. 

The probability distribution of the time to complete each single step in the process 
of the multistep activation is an exponential distribution with rate k 

p{t) = kexp{-kt). (1) 

The mean time is /i = l/fc, the variance is = l/k"^, and the coefficient of variation 
is c.v. = 1. For the whole process with L steps, the probability distribution of time to 
activation is the well known F distribution 

P^it) = Jfl^/^M-kt)t'^'' ■ (2) 

This distribution has a broad peak. An example of the F-distribution with L = 16 and 
/c = 1 is shown in Fig. |2} The moments of the F distribution are readily available: 

fi = L/k, (3) 
a' =L/k\ (4) 
c.v. = (5) 

The observation that multistep chemical reactions reduce the coefficient of variation of 
activation time, has been well understood computationally and experimentally, most 
notably in the context of vertebrate vision [22j. 




Figure 2. The probability distributions of time to activation with no kinetic 
proofreading. We use L = 16, /c = 1, TV = 30,000, and m — 5. The distribution 
is shown for a single receptor (dashed line), the first receptor out of N (solid), and the 
first m receptors out of N (dash-dotted). 

The decrease of the coefficient of variation in proportion to l/y/L holds true for 
a single receptor. However, a cell usually has tens of thousands of receptors, ^ 1. 
We are interested in understanding how this large number affects the mean and the 
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variability of the activation times. The activation time distribution for the first receptor 
out of receptors is 



P^Nit) = N P^it) 1 



Pi{t')dt' 



N-l 



(6) 



where -Pi(t) is the F distribution shown in Eq. ([2]). Here -Pi(t) stands for the probabihty 
of activating the first receptor at exactly the time t , and there are choices of this 
receptor. The last term represents that the other — 1 receptors must not be activated 
before t. For N 3> 1, -Pi|Ar can be simplified since it is almost certain that the first 
receptor is activated at the left tail of the F distribution [t <C although a typical 



receptor takes a much longer time to be activated. Thus, the distribution Pii^v 
rewritten as 

Pi\Nit) ^ - ( - ] exp ' 



where a = (]^r) ^ ■ Pi\n is the Weibull distribution, which we illustrate in Fig. 
Weibull distribution has the following well-known statistical properties 

r 



cr 



c.v. 
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a 



L 



F 1 + 



L 



1.28 



can be 



(7) 
The 

(8) 
(9) 
(10) 



L L ' 

where T{x) is the F function and ip^^\x) is a polygamma function defined as ip^^\x) = 
^lnF(a;). The coefficient of variation goes as oc which decreases much faster with 
L than it does for a single receptor activation, -j=. 

One can envision a situation where a cooperative action of a few receptors, or 
more precisely of their activated messages, is needed to activate the whole cell. We 
can extend our model by studying the distribution of time to activating m out of 
receptors, Pm\Nit)i with 1 ^ m -C A^. There are (^) possibilities to choose the set 
of m receptors to be activated out of total A^ receptors. We require that one of the 
m receptors activates at exactly time t, which has a probability of mPi(t). The other 
m — 1 receptors will finish the process at some point before time t which is described 

m receptors are not activated up to time t. 



by [j;,Pi{t')dt' 
Combining the 



m— 1 



m\N 



. The rest of the A^ 
terms, we get 



-m P^{t) 



m\ {N — m) 
With the same approximation t <^ 

Pm\N{t) 



Pl{t')dt' 



m—1 



1 - [ Pi{t')dt' 

Jo 



L-1 
k ■ 



Pm\N{t) can be rewritten as 



ml (N — m) 



;m Pi{t) 



L\ 
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L\ 



(12) 
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Mean time 


Variance 


c.v. 


example of c.v. 


1 receptor 


L 


L 


1 


25.0% 


1 receptors 






L 


8.0% 


m 


\N receptors 


1 






1 


2.8% 


(m-l)! 




7= 

Ly/m 



Table 1. Comparison of activation time distribution for three different activation 
processes. The example uses the values L = 16, k ^ 1, 171 — 5, and N — 30, 000 



This distribution is shown in Fig. [2] with the usual parameter values, and m = 5. 
Activating multiple receptors has a narrower distribution with a smaller coefficient of 
variation. We calculate 

O.V. « ^« « 1 . (15) 

where the last approximation comes from the first few terms in the asymptotic expansion 
of ip^lm) 1/m — l/2m? + l/24m^, assuming 1. Thus the coefficient of variation 
for cooperative activation is further reduced compared to the activation by the ffist 
receptor. 

The results of the three discussed cases (single receptor, the ffist out of many 
receptors, and the ffist m out of many receptors needed for activation) are summarized 
in Table [Tj The racing to activation mechanism with a large number of participating 
receptors can reduce the coefficient of variation of the activation time to below 5% for 
reasonable parameters. This could mean a more synchronous activation of cells. 



3. Multistep activation with kinetic proofreading 

Receptors can bind many non-specific ligands, and they employ kinetic proofreading 
to increase specificity and activate predominantly following binding of a specific ligand 
[211123]. Our model of kinetic proofreading follows Ref. [21], and is shown in Fig. We 
assume that there are two branches of sequential events in the receptor state diagram: 
a correct branch and a wrong one. The two branches represent activation sequences 
following binding of a specific/nonspecific ligand molecules, with rates ki and /c2 for 
the correct /wrong branches, respectively. For both branches, kinetic proofreading is 
represented by possible jumps back to the initial state with rates 71 and 72, respectively. 
These jumps can correspond to ligand unbinding from the receptor, followed by return of 
the receptor into the initial, fully inactive state. If 72 ^ 71 (that is, a nonspecific ligand 
unbinds faster than the specific one), then the receptor has a much lower probability 
to reach the final, absorbing state along the wrong branch than along the correct one, 
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hence increasing the specificity. While this model is crude, it nonetheless captures the 
basic physical properties of the kinetic proofreading process. 

Kinetic proofreading typically leads to a near-exponential distribution of completion 
times [2U [25], which has c.v. 1. The distribution is different from the exponential 
only over a very short initial time period since it is difficult to traverse the sequence of L 
steps to activation quickly, see Fig. [3) This has a potential of interfering with the racing 
to activation mechanism for improving temporal precision. Indeed, a receptor with an 
exponentially distributed activation time is equivalent to a single-step receptor, L = 1. 
In this case the corresponding F and the WeibuU distributions are exponential as well, 
providing no reduction in the coefficient of variation. 

However, if the number of receptors is sufficiently large, then the first few receptors 
will have a substantial chance to activate during the initial, nonexponential phase of the 
completion time distribution. That is, they will activate quickly in a linear sequence 
of events, as in the previous Section, having no time to revert back to the initial state 
through the proofreading mechanism. Thus if the number of receptors is sufficient 
to complete so quickly along the correct branch, the temporal precision will increase 
through the racing to activation mechanism. On the other hand, since 72 is larger, 
the receptor trying to activate over the same time along the wrong branch may have 
sufficient time to revert to the initial state repeatedly, falling into the exponential part 
of the completion time distribution. This suggests that the number of receptors on 
the cell surface should also be limited from above, so that wrong activation has a low 
probability of happening in the pre-exponential part of the activation distribution. 

Mathematically, we can summarize these arguments as follows. Following Ref . [25] , 
we approximate the probability distribution of completing along a single branch of the 
state space as: 

Pit) - ^^^^^, t'"^exp[-(7 + A:)t], t < t^, (16) 
P(t) ^ -exp (-- ) , t>tc- (17) 



Here the short-time limit starts as the F-distribution, and there is an additional decrease 
in probability of completing at time t by exp(— 7^) because of kinetic proofreading. This 
approximation is valid to the left of its maximum, 
L-l 

tc = T—- 18 

A; + 7 

Further, yU is the mean activation time, calculated in Ref. [25] to be 
1 



/X = - 

7 



1 + ?) -1 



(19) 



The approximation is illustrated in Fig. [3] 

The variance reduction along the correct branch requires that the first receptors 
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Figure 3. The probability distribution of time to activation with kinetic proofreading 
for a single branch with L = 16, A; = 1, and 7 — 0.6. The vertical dotted line is the t of 
maximum probability. On left side of this line, the curves are in log-log scale; while the 
curves on the right are in log-linear scale. Solid line comes from a numerical solution 
of the master equation describing the receptor. We see the power law distribution 
when t is small, and the exponential distribution when t is large. These are illustrated 
with the right tail exponential asymptotic, Eq. ( [l7| ) (dash-dotted line); the left tail 
asymptotic, Eq. (16) (dashed line), and the power law line k^t^~^ /{L — 1)1 (dotted 
line) . 



are activated before tci, that is: 

N J ^^^]^ ^^, t^^-^ exp [- (fci + 71) t] dt » 1, (20) 


where, as before, the subscript 1 stands for the correct branch. Using the standard 
expressions for the incomplete F-function, we thus derive the minimum number A^mm of 
receptors needed for variabihty reduction along the correct branch 

N » iV^in = ^ ^f^I ^- (21) 

l-exp(l-L,) 

This sets the lower bound on the number of receptors on the cell surface. However, 
if the number of receptors was too high, some of the ones bound by incorrect ligands 
would activate in the left tail of the distribution, before the kinetic proofreading has 
a chance to kick in. The requirement that the probability of early activation over the 
wrong branch among all receptors is negligible sets the upper limit on the receptor 
number: 

N J ^^^^t^^-^exp[-(A;2 + 72)t]t^t<l, (22) 
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where the subscript 2 stands for the wrong branch. This results in 

N<^N^^^ = ^ 'jj—^ (23) 

1 - exp (1 - L2) 

While we do not expect that the bounds derived from our manifestly oversimplified 
model predict the real number of receptors on the cell surface, it is still worthwhile 
to verify if the obtained bounds are meaningful, so that kinetic proofreading over a 
wrong branch can be consistent with narrow activation time over the correct one. As 
in Ref. [25], we choose L = 16, and ki = k2 = 1, the same for both the correct and 
the incorrect branches. To choose the kinetic proofreading rates 71^2? we note that they 
correspond, for example, to unbinding of a ligand from the receptor complex, so that 
the log-ratio of the rates are related to the difference of the binding free energies AGi 2 
of the correct and the incorrect ligands: 

AGi- AG2 = keTln^. (24) 

71 

Assuming the difference in the binding free energies of a few thermal energies, we get 

— ~2...10. (25) 

71 

Now choosing 71 ^ 0.7, we get the minimum number of receptors A^min ~ 10^, and 
the maximum number A^max ~ 10'' or more. The window between A^min and A^max 
is sufficiently large to allow the choice of receptor number that satisfies both the 
proofreading and the narrow activation time. Further, these numbers are realistic in 
the context of biological receptor numbers, which range between 10^ and 10^. 



4. Conclusion 



Biological systems must operate under constraints posed by the physical world. One of 
such constraints is robustness to intrinsic noise that comes about from the small number 
of stochastically activated molecular components in cellular networks. Performing 
averages of different kinds (over time, space, or different molecular species) is, essentially, 
the only way of ensuring such robustness. In this context, it has been understood for 
many years that sequential activation of L molecular components averages the times of 
the activation steps and reduces the coefficient of variation of the time to activation, 
which scales then as ~ I/VL [26]. This has been observed experimentally in the context 
of vertebrate phototransduction [22]. Here we propose a mechanism that can further 
reduce the variability to ~ '^/L, and even ~ l/Li/m (where m is the cooperativity of 
the activation process), which may be a substantial improvement for long activation 
sequences and large cooperativities. The reduction of variability is achieved by having 
many equivalent multi-step entities, competing to be the first one to activate the whole 
system. We believe that this mechanism, which we call racing to activation, has not 
been reported previously in the literature. 
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Our focus here is on the general noise suppression mechanism, and not on any 
specific biological system. Nonetheless, it is clear that various receptor-mediated 
signaling systems are likely to be examples of where this mechanism will be applicable. 
The fact that receptors exist on cell surfaces in very large numbers, far in excess of what 
is needed to fully activate a cell, is encouraging. However, the large number of receptors 
needed for reduction of the temporal noise in the racing to activation mechanism can 
come in confiict with the specificity of signaling. It is, therefore, reassuring that, within 
our model, there is a range of receptor numbers that allows cells to maintain specificity 
to certain ligands, and yet be activated precisely in time. 

Another possible application of our model is in development. There a crucial 
question is the precision of position determination afforded by morphogen gradients 
(23 EHl EH]- While most analyses are done for steady state gradients, cells may need 
to make developmental commitments based on pre-steady state transients [30] . Adding 
reverse reactions to Fig. [l](a), with rates ^reverse < ^, our model can be interpreted as 
a biased (nonequilibrium) random walk of a morphogen particle in the physical space, 
where the first m of such particles that reach a morphogen detector at L activate it. 
Our arguments suggest that such temporal, first-passage triggering of developmental 
commitments can be made very precise. 

While too coarse for specific details, our results can be used to make general, 
verifiable predictions about relations between the number of receptors, the similarity of 
specific and non-specific ligands, the cooperativity of receptor action on the one hand, 
and the variability of the activation time on the other. Such data, when available, will 
shed light on whether the proposed mechanism is used in cellular processes, and to 
which extent. 
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